Add logs to the CI output #1789

pvary · 2020-11-19T14:45:42Z

No description provided.

rdblue · 2020-11-19T17:39:20Z

Looks like the log is too long with this enabled:

This file has been truncated. View the full logs in the ... menu once the check is completed.

Maybe we can output logs to files for each module and fetch them when we need them? We can look into getting the files produced.

This reverts commit db71749.

This reverts commit 728aeaf.

pvary · 2020-11-20T14:51:39Z

Looks like the log is too long with this enabled:
This file has been truncated. View the full logs in the ... menu once the check is completed.

It is still possible to see the logs by downloading them starting from the ... from the top right corner of the Details screen.
But I absolutely agree that this is not ideal since we might need to download the full logs to find the failures. That is why I followed up on your suggestion below.

Maybe we can output logs to files for each module and fetch them when we need them? We can look into getting the files produced.

I was able to come up with a solution where I put the output of the tests for subproject specific files, and added them as an artifacts for the builds in case of a failure. This archive is accessible under the Artifacts button. See: https://github.com/apache/iceberg/runs/1431325389 (intentionally created some test failures)

Sample output (build/testlogs/iceberg-parquet.log):

--------
- Test log for: Test testRowGroupSizeConfigurableWithWriter(org.apache.iceberg.parquet.TestParquet)
--------
StdErr log4j:WARN No appenders could be found for logger (org.apache.hadoop.util.NativeCodeLoader).
StdErr log4j:WARN Please initialize the log4j system properly.
StdErr log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info.
--------
- Test log for: Test testListProjection(org.apache.iceberg.avro.TestParquetReadProjection)
--------
StdErr [Test worker] INFO org.apache.parquet.hadoop.InternalParquetRecordReader - RecordReader initialized will read a total of 1 records.
StdErr [Test worker] INFO org.apache.parquet.hadoop.InternalParquetRecordReader - at row 0. reading next block
StdErr [Test worker] INFO org.apache.parquet.hadoop.InternalParquetRecordReader - block read in memory in 1 ms. row count = 1

Since I think this could be useful for every time we run the tests I have configured this logging not only for CI runs but for general test runs as well - some might disagree so please check this.

Thanks,
Peter

rdblue · 2020-11-20T18:38:32Z

.github/workflows/java-ci.yml

+      with:
+        name: test logs
+        path: |
+          **/build/testlogs


I think we just need to archive **/build/reports. The reports have the full stack traces and usually have any stderr/stdout for each test. That may be easier than creating a separate file since I think we already aggregate the logs there.

rdblue · 2020-11-20T18:39:35Z

@pvary, looks great! Thanks for looking into the archive options. I think we may be able to do this more simply by archiving reports instead of creating new log files. The reports I've used have had stderr/stdout.

pvary · 2020-11-20T19:30:33Z

@pvary, looks great! Thanks for looking into the archive options. I think we may be able to do this more simply by archiving reports instead of creating new log files. The reports I've used have had stderr/stdout.

Thanks for reviewing @rdblue!
I have tried archiving reports as a first option. You can see the results here in the artifact: https://github.com/apache/iceberg/runs/1429922302

I found them lacking because of 2 reasons:

Hive often does not propagate the exception to the client (the theory is that infra errors should not be handled by the users), so Hive based tests produce the following exceptions without enough details to investigate (INFO logs contain the real Exception):

org.apache.iceberg.mr.hive.TestHiveIcebergStorageHandlerWithCustomCatalog > testScanTable[fileFormat=PARQUET, engine=tez] FAILED
    java.lang.IllegalArgumentException: Failed to execute Hive query 'SELECT * FROM default.customers ORDER BY customer_id DESC': Error while processing statement: FAILED
: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.tez.TezTask
        Caused by:
        org.apache.hive.service.cli.HiveSQLException: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.tez.TezTask

Having separate directories for listing the tests for every subproject does not help in identifying the failing test at glance

For the 2nd option I have tried the official solution for aggregating the results, but it only does the aggregation if there is no test failure which is less than ideal 😄

So that is why I have decided to backtrack to the "have a file for a subproject output and keep the original test output" solution.

After my trip to the test output aggregation I feel that if we want to have aggregate test results then it should be done in a different PR and here we should concentrate on the archiving the logs

rdblue · 2020-11-20T19:56:23Z

I thought that logs printed to stderr and stdout were available in the test reports. Maybe I'm wrong though.

For the second problem, I thought that you'd usually know which test had failed and go looking for its results.

rdblue · 2020-11-20T19:56:48Z

I'll merge this so that we have something working to debug these cases. We can always improve on it later.

Add logs to the CI output

2a05457

github-actions bot added the build label Nov 19, 2020

Testing log redriection

ad9fd0b

github-actions bot added the INFRA label Nov 19, 2020

Let's see how failure looks like

db71749

github-actions bot added the MR label Nov 19, 2020

Peter Vary added 7 commits November 19, 2020 23:02

Another failure

728aeaf

Test upload-artifact

3d52e44

Run this in case of failure

0744d27

subproject specific logging

e411eb0

Rename the logs

c18548a

Revert "Let's see how failure looks like"

3c43f51

This reverts commit db71749.

Revert "Another failure"

1188f4d

This reverts commit 728aeaf.

rdblue reviewed Nov 20, 2020

View reviewed changes

rdblue merged commit 275ec9a into apache:master Nov 20, 2020

pvary mentioned this pull request Nov 30, 2020

Set gradle build heap size to avoid build failure #1847

Merged

pvary deleted the cilogging branch January 7, 2021 08:25

pvary mentioned this pull request Apr 9, 2021

Spark: Refactor snapshot and migrate actions #2437

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add logs to the CI output #1789

Add logs to the CI output #1789

Uh oh!

pvary commented Nov 19, 2020

Uh oh!

rdblue commented Nov 19, 2020

Uh oh!

pvary commented Nov 20, 2020 •

edited

Loading

Uh oh!

rdblue Nov 20, 2020

Uh oh!

rdblue commented Nov 20, 2020

Uh oh!

pvary commented Nov 20, 2020

Uh oh!

rdblue commented Nov 20, 2020

Uh oh!

rdblue commented Nov 20, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add logs to the CI output #1789

Add logs to the CI output #1789

Uh oh!

Conversation

pvary commented Nov 19, 2020

Uh oh!

rdblue commented Nov 19, 2020

Uh oh!

pvary commented Nov 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rdblue Nov 20, 2020

Choose a reason for hiding this comment

Uh oh!

rdblue commented Nov 20, 2020

Uh oh!

pvary commented Nov 20, 2020

Uh oh!

rdblue commented Nov 20, 2020

Uh oh!

rdblue commented Nov 20, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pvary commented Nov 20, 2020 •

edited

Loading